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Abstract. We have employed feedforward neural networks trained on synthetic spectra in the range 3800 to 5600 A 
with resolutions of 2-3 A to determine metallicities from spectra of about 1000 main-sequence turn-off, subgiant 
and red giant stars in the globular clusters M55 and cj Gen. The overall metallicity accuracies are of the order 
of 0.15 to 0.2 dex. In addition, we tested how well the stellar parameters log g and Tefj can be retrieved from 
such data without additional colour or photometric information. We find overall uncertainties of 0.3 to 0.4 dex 
for log g and 140 to 190 K for Tefj. In order to obtain some measure of uncertainty for the determined values 
of [Fe/H], log g and Teff, we applied the bootstrap method for the first time to neural networks for this kind 
of parametrization problem. The distribution of metallicities for stars in ui Gen clearly shows a large spread in 
agreement with the well known multiple stellar populations in this cluster. 
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1. Introduction 

The advance in modern spectroscopic techniques coupled 
with 8-meter class telescopes makes it possible to simul¬ 
taneously obtain spectra for a large number of stars in 
clusters. Although such spectra as analysed in this work 
do not have high enough resolution to make detailed line 
analyses, they offer the possibility to measure abundances 
with good precision (~ 0.2 dex). This permits, for exam¬ 
ple, statistically significant analyses of stellar populations 
in dwarf spheroidal galaxies or studies of the intra- and 
intercluster abundance variations of globular clusters. 

While the data acquisition and subsequent reduction 
are themselves challenging, the analysis of such a large 
amount of data can be efficiently handled by the intro¬ 
duction of automated methods. This can be as accurate 
as spectral analyses (using line indices etc.) carried out by 
a human expert, a s proven in several ear lier works (see e.g. 
ISnider et alJl200lI ISoubiran et al.lll998h . Several of these 
works show that the accuracy of the results was limited 
by the quality of the data rather than by the automated 
algorithm used to analyse them. 
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* Based on observations obtained at the European Southern 
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Recent studies using automated methods to retrieve 
stellar parameters Teff, log g and [Fe/H] from observed 
data were limited in various ways. (In the following 
we call this process ‘pa rametrization’.) For example, 
iBonifacio fc Caffiii] ll2003tl employed a minimum distance- 
based classifier to high resolution spectra of stars in the 
Sagittarius Dwarf Spheroidal but were restricted to cer¬ 
tain luminosities and temperatures. Earlier efforts, espe¬ 
cially those using neural networks, were also limited in 
their analysis of errors of the determined stellar parame¬ 
ters. Most often, the precision with which stellar param¬ 
eters can be retrieved is evaluated by using a validation 
set with known target values. While this is correct to get 
some estimate of the overall uncertainty, this technique 
cannot be applied to previously unseen data, where such 
additional information is of course not available. However, 
any parameter estimate is only meaningful if some direct 
measure of uncertainty can be applied. To overcome this, 
we employed the bootstrap method to neural networks 
which allows us to assign individual error bars to the es¬ 
timated stellar parameters. 

ISnid~ et a,lJ (l2nnill . using neural networks on spec¬ 
tral data with resolutions similar to those in this work, 
could determine all three stellar parameters (Teg, log g 
and [Fe/H]) well. Indeed, they showed that the precisions 
are similar to those that can be obtained from fine anal¬ 
yses of high resolution spectra. While their neural net- 
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Fig. 1. Examples of observed spectra. The left panels show the spectra as obtained by the blue grism, the right column 
that of the red grism. The upper panel shows spectra of a red giant star, the lower for a main sequence star in M55, 
i.e. [Fe/H] = —1.8 dex. Note the larger number of absorption lines in the spectrum of the cool giant. In the lower plot 
the wavelength ranges used for the neural network analysis are indicated. 


work training and validation sets contained spectra from 
spectroscopic observations, our work relies entirely on syn¬ 
thetic spectra as templates. Of course, ‘real world’ spectro¬ 
scopic data with stellar parameters assigned from spectral 
synthesis analyses are the ideal case but are expensive in 
terms of observing time and effort. In this work we demon¬ 
strate that synthetic spectra in combination with auto¬ 
mated methods can be used to determine reliable stellar 
parameters from spectroscopic observations. 

We examine spectra from stars in the globular clusters 
M 55 and w Centauri. These clusters serve as a demon¬ 
stration of our methods. We have recently obtained a 
much larger database of stellar spectra in several globular 
clusters, and these will be subject to an extensive anal¬ 
ysis in the future. M55 has a metall i city of about —1.80 
dex llZinn fc Westlll984 iHarrislITQQfii iRichter et alJll99^ 

and is used primarily to validate our method, uj Cen, in 
contrast, is known to harbour several stellar populations 
with different kinematical characterist ics and a metallicity 
range from —2.5 to —0-5 dex (see e.g iNorris fc Da Costal 
Il995i ISuntzeff fc Kraftlll996|]. suggesting an entirely dif¬ 
ferent formation history to that of normal globular clus¬ 
ters. Indeed, several earlier spectroscopic and photomet¬ 
ric studies found a prolonged formation period of about 
2 to 5 Gyr, suggesting that this object is more likely the 


nucleus oi an accret 
ter JLee et al.lll999l 

ea awari gaiaxv tnan a 
Hilker & riichtleill200nl 

ffiODuiar cius- 
iHughes et al.l 

l2004 and especially 

Hilker et alJ2004|). In addition, it was 


found that there is a clear trend of the a element abun¬ 
dance s with the overa l l met allicity (see e.g. lPancino et abl 
l2002l or lOriglia et al ] l2nn,‘^] . The populations in uj Cen 


can be subdivided into a dominant metal-poor compo¬ 
nent with a metallicity peak at —1.7 dex which accounts 
for about 70% of the stars. Some 25% of the stars belong 
to the intermediate metallicity population with [Fe/H] ~ 
— 1.2 dex while the la st 5% are me tal-rich stars with [Fe/H] 
~ —0.6 dex (see e.g. ISmithl li^. 


The purpose of this work is to demonstrate the abil¬ 
ity of automated methods (here neural networks) to de¬ 
rive accurate metallicities as well as surface temperatures 
and gravities from spectra with medium resolutions of ~ 
2-3 A. Sect. 121 and 121 give a summary of the acquisition, 
reduction and selection of the observations, while Sect.^ 
describes the set of synthetic spectra. The training pro¬ 
cedure of the neural networks and the application of the 
bootstrap method are described in Sect. 121 In Sect. El the 
networks are validated on (observed) spectra of stars in 
M 55 (with known metallicity and log g and T^s estimated 
from isochrones) as well as specific stars with known stellar 
parameters. The same networks are then applied to stellar 
spectra in io Cen in order to derive preliminary metallic¬ 
ity values. Since this cluster is indeed very peculiar, we 
do not attempt to interpret these results in great detail. 
Rather, we want to show that the networks can yield re¬ 
liable results across a large range of abundances for stars 
in different evolutionary states. 
















































Willemsen et al.: Analysis of medium resolution spectra 


3 




velocity error [km/s] velocity error [km/s] 


Fig. 2. Distributions of the velocity errors of the red grism 
(left column) and blue grism (right column) for the spec¬ 
tra of M55 (top panels) and w Cen (bottom panels). The 
errors in the red grism are somewhat larger because of a 
smaller number of prominent absorption lines available for 
the cross-correlation procedure. 


2. Observations and reductions 


2.1. Data acquisition 

The data were obtained on the nights 7-12 May 2002 at 
the VLT at ESO/Paranal (Chile) in visitor mode. The in¬ 
strument was the FORS2 low dispersion spectrograph in 
combination with the mask exchange unit (MXU) and a 
4 X 4 k MIT CCD with a field of view of 6.8 square ar- 
cminutes. On average, about 60 stars were observed per 
mask, w ith the candidates chos en from Stromgren pho¬ 
tometry ( Hilker fc H.ichtlerl2nnflll . The slit width was fixed 
to 1" and the slit length ranged f rom 5 to 7". The s eeing 
was always b e tter t han 1". While lllilker et rdl ll2004ll and 
iKavser et al ] ll2005l) examined the main-sequence turn-off 
and SGB stars, the present work includes observations of 
the red giant and upper main sequence stars in both oj Cen 
and M 55. Each mask was observed through a ‘blue’ grism 
(ESO 600I-I-25, second order) and a ‘red’ grism (ESO 
1400V-I-18) with maximum wavelength coverages of 3690 
to 4888 A and 4560 to 5860 A. The corresponding disper¬ 
sions are 0.58 A pix“^ and 0.62 A pix“^, respectively. The 
exposure times were chosen in such a way as to ensure that 
the minimum signal to noise ratio (S/N) per pixel was ~ 
40-50 for the majority of objects. For giant stars, the av¬ 
erage S/N was slightly larger, about ~ 70-80. In the same 
way, the observations for M 55 were carried out with differ¬ 
ent masks for main-sequence, subgiant and giant stars. In 
order to have reference spectra from stars with known stel¬ 
lar parameters, long slit spectra with the same grism/filter 
combinations were obtained for eleven bright stars with a 
typical S/N of about 100-150. These stars are referred 
to as ‘standard stars’ in the following. In addition to the 


science frames, bias, fiat field and wavelength calibration 
images were obtained during daytime. The log of the ob¬ 
servations reduced for this work is shown in Table Q A 
summary and outline of the reductions and analysis based 
o n line indices for the main sequence turn-off sta rs is found 
in iHilker et al ] ll2004^ and iKavser et al.l ll2005ll . It should 
be noted that the sample of observed stars is not complete. 
The focus of these observations was to get a large sample 
of main-sequence turn-off and subgiant stars which are 
best suited to break the age-metallicity degeneracy. The 
upper main-sequence and red giant stars primarily serve 
to provide additional constraints for isochrone fitting of 
the different (age/metallicity) populations in uj Cen. 

2.2. Data reduction 

The data reduction of the observations was done by var¬ 
ious tools within IRAF^. The images of each mask were 
bias corrected, (dome) fiat fielded and cleaned of cosmic 
rays. The image was then corrected for pixel-to-pixel vari¬ 
ations and in a next step the individual spectra were ex¬ 
tracted, both being performed by the ‘apall’ task. The 
sky was generally subtracted using adjacent regions in 
the same slit, but in a few cases it was necessary to take 
the background information from other slit regions in the 
mask. In these cases, the sky correction was performed 
by the task ‘skytweak’. Although normally satisfactory, 
this procedure did result in erroneous sky subtractions in 
some cases. We therefore excluded spectra reduced with 
‘skytweak’ from the analysis. Each stellar spectrum was 
wavelength calibrated by 10 to 15 lines of He, Ne, Hg and 
Cd and rebinned to a final dispersion of 1 A pix“^, yield¬ 
ing an effective resolution of 2 to 2.5 A. 

The continuum normalization of the spectra is cru¬ 
cial, since it can directly influence the parametrization 
performance. We tried several techniques to divide out 
the continuum. These included different combinations of 
line exclusion, median filtering techniques and continuum 
fitting functions. It was found that excluding the regions 
adjacent to the strongest absorption lines in combination 
with the IRAF task ‘continuum’ and careful adjustment 
of the various parameter settings therein yielded the best 
results. But we note that a good continuum fit is natu¬ 
rally harder to find for metal-rich stars ([Fe/H] ~ —0.5 
dex) due to heavy line blending. This should be kept in 
mind when interpreting the parametrization results for the 
more metal-rich stars. Furthermore, the S/N in some cases 
is not the same for the two grisms for the same object. 
We did not explicitly test how this influences the overall 
parametrization but the results suggest that this effect is 
of little importance. 

Since the slits cover a range of positions on the mask, 
the wavelength coverage is not the same for all stars. In 

^ IRAF is distributed by the National Optical Astronomy 
Observatories, which are operated by the Association of 
Universities for Research in Astronomy, INC., under cooper¬ 
ative agreement with the National Science Foundation. 
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Table 1. Table of the observations reduced and analysed for this work in addition to those reported bv iKavser et alJ 
tdOFt) . The different field names in uj Cen are abbreviated by their location in the cluster (e.g om-e corresponds to 
east). The ending ‘rgb’ refers to red giant stars, ‘to’ to main-sequence turn-off stars, ‘ms’ correspondingly to upper 
main-sequence stars. The field of view of a MXU mask is 6.'8 x 6f8. 


date 

mask 

02000 

^2000 

exp.time (red) [sec.] 

exp.time (blue) [sec.] 

2002/05/10 

om-n-rgb 

13:26:46.7 

-47:18:43.3 

1 X 120, 1 X 240 

2 X 300 

2002/05/11 

om-e-rgb 

13:27:48.1 

-47:27:27.4 

2 X 180 

2 X 240 

2002/05/11 

om-sl-rgb 

13:26:57.4 

-47:37:43.1 

2 X 180 

2 X 240 

2002/05/11 

om-n-ms 

13:26:46.7 

-47:18:43.2 

3 X 1080 

3 X 1200 

2002/05/12 

om-s2-rgb 

13:26:57.3 

-47:43:42.9 

2 X 180 

2 X 240 

2002/05/12 

om-s2-ms 

13:26:57.3 

-47:43:40.0 

3 X 960 

3 X 1200 

2002/05/12 

om-sl-ms 

13:26:57.8 

-47:37:43.7 

3 X 900 

3 X 1200 

2002/05/12 

om-e-ms 

13:27:48.1 

-47:27:30.8 

3 X 900 

3 X 1200 

2002/05/10 

M 55-to 

19:39:59.1 

-30:53:01.2 

3 X 600 

3 X 720 

2002/05/11 

M 55-rgb 

19:39:58.9 

-30:52:56.4 

2 X 180 

2 X 240 

2002/05/12 

M 55-ms 

19:39:58.9 

-30:52:59.0 

2 X 900 

2 X 900 


Table 2. Summary of the stars used for validating the network mod el. The [Fe/H], log q and Tpff v alues given are the 
averages from the works of several authors found in the literature fsee lCavrel de Strobel et ^l200lll . The uncertainties 
are either the standard deviations about the mean values (in the case that multiple values are listed) or the errors 
stated by the author (when there is only a single measurement). The last column shows the number of separate 
determinations in each case. 


ID 

[Fe/H] [dex] 

a ([Fe/H]) [dex] 

log g [dex] 

c(log sr)[dex] 

Teff [K] 

u(Teft) [K] 

num 

BD-032525* 

-1.90 

0.08 

4.00 

0.57 

5739 

40 

2 

BD-052678 

-1.89 

— 

4.43 

— 

5570 

— 

1 

BD-084501 

-1.59 

0.27 

4.00 

- 

5750 

210 

2 

BD-133442 

-2.95 

0.21 

3.92 

0.25 

6240 

75 

3 

HD089499** 

-2.14 

0.08 

2.92 

0.88 

4875 

100 

3 

HD097916 

-1.05 

0.19 

4.09 

0.10 

6222 

140 

5 

HD179626 

-1.26 

0.04 

3.70 

0.06 

5625 

40 

1 

HD192718 

-0.74 

0.04 

4.00 

0.06 

5650 

40 

1 

HD193901 

-1.08 

0.12 

4.38 

0.32 

5713 

70 

8 

HD196892 

-1.02 

0.10 

4.04 

0.27 

5900 

100 

4 

HD205650 

-1.12 

0.13 

3.93 

0.64 

5729 

90 

3 


* Spectroscopic binary 
** Star in binary system 


order to produce a homogeneous grid of all spectra, we 
restricted the wavelength range from 3850 to 5700 A with 
an overlap of the red and blue grisms’ contributions at 
4700 A. Moreover, for the analysis only certain wavelength 
ranges in the spectra were used, the regions being defined 
after extensive simulations on synthetic spectra. These are 
3900-4400 A, 4820-5000 A and 5155-5350 A. As expected, 
these regions, found by comparing the parametrization re¬ 
sults for networks trained on different ranges, contain the 
most prominent metal and hydrogen lines. The restric¬ 
tion to certain wavelength intervals and the exclusion of 
(hopefully) unimportant ranges minimizes the number of 
inputs to the network, thus increasing the ratio of training 
templates to free parameters in the network (see Sect. 
Examples of typical spectra are shown in Fig.^ 

3. Grid of observed spectra 

We determined stellar radial velocities by cross-correlating 
the spectra with suitably chosen template spectra. With 


these data at hand, we corrected the spectra of each star 
individually in each grism (blue and red). The radial ve¬ 
locities were also used as an extra indicator for cluster 
membership (in addition to the position of the star in 
the colour-magnitude diagram) by comparing them to the 
velocities of the clusters. These are 175 k m s~^ and 23 8 
km s“^ for M55 and uj Cen, respectively llHarri ilHi). 
In order to account for systematic effects not corrected 
in this data reduction (such as small mismatches between 
the wavelength calibration and science images), we chose 
rather large velocity intervals for the cluster membership 
decision, namely [100:200] km s“^ for M55 and [150:350] 
km s“^ for uj Cen. In this way about 6 % of all stars were 
excluded as being nonmembers. 

The distributions of the velocity errors from the red 
and blue grism are shown in Fig.|3 Each error combines 
the individual cross-correlating error for a given spectrum 
(as determined from the IRAF routine ‘fxcor’) and that 
of the corresponding template stars. 
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Some additional spectra were eliminated because of 
the presence of light polluting bright stars next to the 
slit. However, two effects coming from bright stars that are 
close to the observed field can cause erroneous sky subtrac¬ 
tion. First, there may be reflections that cause gradients 
in the background, and second, there may be a contami¬ 
nation in the wings of the PSF. Finally, some low quality 
spectra were also eliminated. 

We split our observed stars into two luminosity class 
categories based on the available Johnson photome¬ 
try: MSSGB (comprising all main-sequence and subgiant 
stars) and RGB (all red giant stars). This cut is sensible 
since the stellar parameters log g and Tea of these two 
groups are significantly different. Moreover, in order to 
improve the neural network performance, it is generally 
recommend ed to simplify the problem as much as possi¬ 
ble (see e.g. lHavkfnllQQflll . More explicitly, by splitting the 
regression problem (the mapping from inputs to outputs) 
into two parts (in terms of stellar parameters), we hope 
that the overall approximation of the underlying function 
by the network is improved by lifting possible ambiguities 
in the mapping function. Tests showed that the results are 
indeed better when considering two object categories in¬ 
stead of one. This is certainly also due to the different S/N 
of the MSSGB and RGB objects. As described in Sect.EJ 
the giants have generally higher S/N than the subgiants 
and main sequence stars. It was therefore necessary to 
train networks on different ranges in S/N (see below). For 
M 55 there are 130 stars in the MSSGB and 35 stars in the 
RGB sample, while for w Gen the numbers are 800 and 
66, respectively. 

Since we have synthetic training samples only for 
stars up to the helium flash, we removed all horizontal 
branch and suspected asymptotic giant branch stars in 
both our M55 and uj Gen samples. We will consider the 
parametrization of these in a future study, which will also 
include an analysis of a and GN abundances. 

4. Grid of synthetic spectra 

We have calculated synthetic (t emplate) spec t ra us- 
ing the model atmospheres from lGa,ste]]i et af.l (jl 
in combination with th e latest version of SPECTRUM 
llGrav fc Gorballvll 19941) along with its line list. Since our 
study focuses on globular clusters, the metallicity range of 
the calculated spectra was limited to — 2.5 dex < [Fe/H] 
< — 0.5 dex, while temperature and surface gravity are 
limited to 3500 K < Teff^ 8500 K and 0 dex < log g< 5.0 
dex respectively. Note that the grid is not complete, i.e. 
not all parameter combinations exist. The spectra were 
calculated with a step size of 0.02 A in the wavelength 
range 3850 to 5700 A. The microturbul ence was fixed at 2 
km s~ ^. Similarly to the procedure of iBailer- Jones et a,1.l 
(HHl), the spectra were dispersion-corrected and binned 
to have the same resolution and wavelength range as the 
observed data. 

As will be described in Sect. El we used spectra with 
different S/N to train the network models. To do so, we 


added the corresponding noise for a given S/N before nor¬ 
malizing the spectra. The wavelength of scaling the S/N 
was chosen at 4500 A which is in the overlap region of the 
two grisms. 

For the present tests we considered solar scaled a abun¬ 
dances, something which is empirically found to be only 
the case for halo obj ects with metallicities > — 1 dex (see 
e.g. lMcWifiia.mll 997|i . However, this does not pose a prob¬ 
lem since the parametrizer is explicitly trained only on 
metallicity as expressed by [Fe/H] (and not on a abun¬ 
dances). The regression model (the neural network) is 
therefore expected to focus on the changes as caused by 
different [Fe/H] values, thus ignoring spectral dependen¬ 
cies caused by other specific elements. 


5. Artificial Neural Networks 


Neural networks have proven useful in many scientific dis¬ 
ciplines for analysing data by providing a nonlinear map¬ 
ping between an input domain (in this case the spec¬ 
tra) and an output domain (the stellar parameters). For 
details on this subject with special e mphasis on stel¬ 
lar pa r ametrization see, f o r exa m ple. IWillemsen et a.lJ 
ll2003l). iBailer- Jones erall lll997l). Bailer-Jone^ ll2(j00l). 
ISnider et all l|200ll) . and especiallv lBailer-Jone 


for 

eneral overview. The software used in this work is that 



a general overv i ew, i n 
of IBailer-JonesI ( 1998h . 


A neural network can be visualized by several layers, 
where each layer is made up of several so-called nodes or 
neurons. In general, for the type of network used in this 
work, one distinguishes an input-layer, one or two hidden- 
layers, and an output-layer. The nodes in each layer are 
connected to all the nodes in the preceding and/or fol¬ 
lowing layer via some free parameters (‘weights’). While 
the nodes in the input layer simply hold the inputs to 
the network (i.e. the fluxes in the wavelength bins of 
the spectrum), each node in the hidden layer performs 
a weighted summation of all its inputs. This sum is then 
passed through a nonlinear ‘transfer function’ (here the 
tank function) and the result is then passed further to 
a node in the next layer. The output nodes do the same 
as the nodes in the hidden layer, except that the transfer 
function is here chosen to be linear. Before the network 
can be applied to predict the stellar parameters for previ¬ 
ously unseen (observed) spectra, it must be trained on an 
existing set of (here synthetic) spectra with known stellar 
parameters. During this training phase, the weights of the 
network are iteratively adjusted in order to accurately pre¬ 
dict the outputs (stellar parameters), based on the inputs 
(spectral fluxes). This weight adjustment is done accord¬ 
ing to some measure of error (‘cost function’). Here we 
use the sum-of-squares error calculated from the networks’ 
outputs and the corresponding true stellar parameters for 
a given synthetic spectrum. Once this error is sufficiently 
small, the training is stopped and the weights are fixed. 
The network can now be used to predict the stellar param¬ 
eters of a star based on the fluxes in a stellar spectrum. 
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There are several parameters in a neural network which 
must be adjusted, often by careful experiments. In the 
model used here these are essentially a weight decay term 
(which inhibits overhtting by a too complex network), a 
weighting term for each stellar parameter in the cost func¬ 
tion, the number of hidden layers, as well as the number 
of weights in each hidden layer and the number of training 
iterations. After extensive tests we found that a network 
with two hidden layers each having 13 neurons provided 
the best results. More neurons did not significantly im¬ 
prove the results but only increased the training time, 
while a smaller number of neurons resulted in a poorer 
model with larger errors. 


In principle, one could train three independent net¬ 
work models, each with a single output, for each of the 
three stellar parameters (Tea, log g and [Fe/H]). It is well 
known, however, that changes in the values of these stel¬ 
lar parameters may have similar effects on a stellar spec¬ 
trum (both lines and continuum). A special example is 
the helium abundance, where an increase of this element 
changes the line sh apes in the same way as does surface 
gravity llGravlll992l) . To account for such degeneracies we 
therefore trained the networks on all three stellar parame¬ 
ters simultane ously. As exp e cted f rom the above, but con¬ 
trary to what ISnider et a.lJ (| 2 nmli found, a network with 
multiple outputs performed better than one trained only 
on one single parameter. A possible reason for this might 
be that we used two hidden layers. It is generally known 
that a second hidden layer is important to model the ‘sub¬ 
tle’ information i n the data (he re metallicity and surface 
gravity), see e.g. iBishonl l|l99,^ . Networks with two hid¬ 
den layers might therefore be better suited for this kind 
of parametrization problem. 


5.1. Network training and regularization 


Table 3. Summary of the regularization results from neu¬ 
ral networks with different training sets and weight decay 
term 7 . Increasing 7 results in smaller weights of the net¬ 
work and thus a more regularized solution. However, too 
large a value of 7 will again result in larger deviations, i.e. 
there is a trade-off in setting this parameter. The metal¬ 
licity deviations for the standard stars are given in terms 
of the median values of the difference (computed value — 
literature value). It can be seen that training on noise-free 
data and validating on noisy data systematically underes¬ 
timated metallicities. The results demonstrate that noise 
in the network inputs can help improve the regularization. 


noise in training set 

7 

[Fe/H] offset 

no 

0.0001 

-0.16 

no 

0.001 

-0.11 

no 

0.01 

-0.07 

yes 

0.0001 

0.18 

yes 

0.001 

0.02 

yes 

0.01 

0.05 


serves as an additional regularization mechanism (noise 
being hard to fit). The results of these tests are summa¬ 
rized in TableOI 

We therefore trained the networks on sets of noisy 
versions of the training templates (synthetic spectra). A 
training sample of size is a collection of several stellar 
parameter combinations expressed by {i S [ 1 , A"]) and 
associated spectral flux vectors x^, where each of these is 
represented P times with different S/N ratios, i.e. 


/ \ 


( \ 

(®2;y2) 

where Xi = 


\ {xN]yn) / 


\ xr^ ] 


The observed spectra cover a range of different S/N. The 
neural network model (or the training data) should ac¬ 
count for this. Indeed, we found that networks trained on 
templates with high S/N and then validated on spectra 
with a much lower S/N increased the bias and (to a smaller 
degree) also the variance of the distribution of residuals. 
This b ehaviour was also found in tests by I Snider et al.l 
il 2 nnih . To investigate this further, we trained networks 
on different training sets with multiple noise versions of 
a spectrum (only one noisy spectrum at each S/N) for a 
given stellar parameter combination. In addition, we tried 
different values for the weight decay (regularization) pa¬ 
rameter 7 . We found that a network trained only on high 
S/N templates (S/N =100) can reproduce the stellar pa¬ 
rameters for spectra with different S/N reasonably well, 
but only if the weight decay term is chosen to be rather 
large. This shows that this is a regularization problem, i.e. 
a network trained on high S/N spectra will overfit the data 
unless ‘restrained’. When using multiple noisy versions of 
a training template the results improved slightly, which is 
understandable since additional noise in the training data 


For the MSSGB case (main sequence and subgiants) we 
used synthetic spectra with 3.5 dex < log g < 5.0 dex, 
4000 K < Teff< 8750 K, -2.5 dex < [Fe/H]< -0.5 dex 
and scaled to S/N ratios between 10 and 100, in steps of 
10. For the RGB stars, the stellar parameters were 0.0 dex 
< log g < 3.5 dex, 3500 K < Teff< 6000 K and the same 
metallicity range but with S/N ratios in the range from 
50 to 150, in steps of 10. Note that there is an overlap in 
log g and Teg for the two samples which is necessary to 
properly test those stars with parameters at the edge of 
the grids. 

As mentioned above, the networks can also be tuned 
by a weighting term for each stellar parameter in the cost 
function. We used this term to set up two network models 
as follows. 

modell is a network tuned to yield good metallicity 
parametrization results. This network was trained on all 
three stellar parameters, i.e. [Fe/H], log g and Teg, but 
with special weight on metallicity. 

model2 is a network tuned to yield good log g and Teg 
performance. As modell, this network was trained on all 
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stellar parameters, but with a higher weight on surface 
gravity and temperature. 

Note that each model was trained on each of the two 
training sets (MSSGB and RGB) separately. Since the 
minimization process can become trapped in local min¬ 
ima during learning, we set up a committee of 10 net¬ 
works (with different initial weight settings) for each of 
the models. 

As described in Sect. o we used three disjoint wave¬ 
length ranges from the spectrum, yielding a total of 878 
wavelength bins (network inputs). Including the ‘bias’ 
weights in the input and hidden layers, the total number 
of weights is therefore 11648. This is large and it might 
be surprising that there are meaningful results given that 
there are only 4990 and 4818 training templates for the 
MSSGB and RGB cases, respectively. However, the effec¬ 
tive number of free parameters in the network is certainly 
much lower, because much of the spectrum contains re¬ 
dundant information. Furthermore, the smoothness of the 
input-output function which is to be approximated by the 
network will also lower the effective number of free param¬ 
eters. 

5.2. Error estimates from Bootstrapping neural 
networks 


outputs reasonably well, the latter source of uncertainty 
is not considered further. 

The bootstrap was introduced byli^illii^) for es¬ 
timating various sample properties such as bias, variance 
and confidence intervals on any population parameter es¬ 
timate. Given a (random) training sample S comprising 
pairs of inputs x and corresponding outputs y, i.e. S = 
(xi, yi), (x 2 , y 2 ),..., (xat, yAr), drawn from a population F, 
we want to estimate some parameter (e.g. the network 
weights) 9 = f{F). This could be done by calculating 
9 = y(S) based on S. The bootstrap is a data-based 
method for statistical inference which allows us to esti¬ 
mate the error in the network outputs from different val¬ 
ues of 9. To determine this bootstrap standard error, one 
needs to build bootstrap samples. A bootstrap sample, 
S*, is a random sample of the same size N as the orig¬ 
inal sample which is created by randomly resampling S 
with replacement (i.e. S* C S C F). In this way, one 
obtains B bootstrap samples, the 6th bootstrap sample 
given by (xf, yf), (x;^ yf ),..., y))?). For each sam¬ 
ple, we minimize “ 2/(^5 ^n)]^ (i-®- derive B 

regression functions by training a network on each of the 
B bootstrap samples) to yield 9*^ in each case. The (non- 
parametric) estimate of the bootstrap standard error, BSE, 
is the square root of the variance of the distribution, which 
for the nth predicted value y„ is 


In most applications of neural networks to determine stel¬ 
lar parameters, uncertainties (error bars) on the estimated 
parameters are derived from the distribution of the param¬ 
eter residuals of some validation set with known target 
values. While this method is correct in order to get an 
overall estimate of the precision to which parameters can 
be determined, it does not yield individual uncertainties 
for each determined parameter. This is crucial, however, 
since objects that lie close to the boundary of the train¬ 
ing grid tend to have higher uncertainties, given that the 
performance of neural networks (or indeed any regression 
model) is ge nerally weak at the grid boundaries (see e.g. 
lBishoniri995l) . The concept of using the bootstrap method 
for the determination of error bars in the frame work of 
neura l ne tworks is discussed in severa l articles le.g. lHesk^ 
Il997l and lDvbowski fc Bobertij bOOOh. For the special case 
of stellar nara.metrization see lWiHemsen et~^ 

Since a neural network is a regression model that maps 
inputs onto outputs, we can use the concepts of stan¬ 
dard errors and confidence intervals also for this kind of 
parametrizer. There are basically two sources of uncer¬ 
tainty arising with neural networks. The first stems from 
the fact that the training data are noisy and incomplete, 
i.e. the construction of a training grid by (randomly) sam¬ 
pling templates is already prone to sampling variations. 
The second source of uncertainty is given by the model 
limitations which arise from the optimization failing to 
find the global optimum. Another source of error is an 
inappropriate (e.g. inflexible) parametrization model (e.g. 
linear). Since we assume that our network model approx¬ 
imates the underlying mapping from the inputs to the 


BSE(x) = 


\ 


B 


1 


[y(x„;d*'') - yboot(x„; •)]" 


( 2 ) 


where yboot(x„; ■) is shorthand for the bootstrap commit¬ 
tee’s prediction given by -g X]b=i v{'^n] 9*’^)- 

Note that the random resampling is done over the stel¬ 
lar parameters and not over the noise versions. 

It is generally recommended that the number of 
bootstrap samples be in the range from 25 to 200 
( Efron Tibshira.nill 99, “ih . For this work, we chose R = 30 
to save computational power. Limited tests showed that 
larger numbers did not significantly change the results. 
The random resampling was done with a uniform random 
number generator as given in IPress et all l|l992j) . 


6. Validation of the network model 

In the following, the results of the network validation are 
presented. As mentioned before, the metallicity results 
were obtained from network modell which was tuned to 
yield accurate results for this parameter. The results for 
surface gravity log g and temperature T^s were found from 
network model2, i.e. a network trained on all three stellar 
parameters as outputs but with a higher weight on log g 
and Teff (see Sect. 15.111 . 

Unless stated otherwise, we summarize the overall 
parametrization results in terms of the average absolute 
deviation (over some set of spectra), i.e. 

1 ^ 

A=j;^-J2\^{p)-T{p) 

p^l 


( 3 ) 
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Table 4. Table of Stromgren observations of M 55. The columns show the central coordinates of the observed fields and 
the exposure times in the different filters. Certain parts of these fields overlap with those of the FORS2 spectroscopic 
observations. 


date 

field 

02000 

^2000 

exp.time(a) 

[sec] 

exp.time(6) [ 

sec] 

exp.time(i/) [sec] 

2001/07/12 

M55-1 

19:40:19.0 

-30:59:30.0 

2 X 30, 3 X 

240 

1 X 20, 2 X 

90 

3 X 10, 3 X 60 

2001/07/13 

M55-2 

19:40:19.0 

-30:53:30.0 

1 X 30, 2 X 

240 

1 X 20, 2 X 

90 

1 X 10, 2 X 60 

2001/07/14 

M55-3 

19:39:47.1 

-30:56:01.1 

1 X 30, 2 X 

240 

1 X 20, 2 X 

90 

1 X 10, 2 X 60 


where p denotes the p**' spectrum and T is the target (or 
‘true’) value for this parameter. The quantity C{p) is the 
parametrization output averaged over a committee of 10 
networks (see Sect. El 

6.1. Results for M55 

In order to compare the networks’ outputs of surface grav¬ 
ity and temperature based on the observed spectra, we 
have to get an estimate of the ‘true’ values for these 
parameters. For this, we did not use the pre-imaging 
BV Johnson photometry (because of missing photomet¬ 
ric standard star observations and saturated giant stars), 
but rather decided to use more precise measurements of 
M 55 obtained in the Stromgren system. These data were 
obtained (in addition to observations in other filters) at 
the Danish 1.5 m telescope at La Silla on the nights 12-15 
July 2001. The instrument’s field of view is 6f3 x 6f3, the 
seeing was typically about 1"5. A summary of the data 
reduction is given in the next section. 


6.1.1. Stromgren photometry of M 55 


Initially the photometric calibration, relating instrumen¬ 
tal to standard magnitudes, was determi ned. For this we 
used observations of 12 E region stars I Jonch-SorensenI 
Il993l) which were reduced in a standard way (bias sub¬ 
tracted and flat fielded) and the instrumental magnitudes 
determined via aperture photometry. The extracted mag¬ 
nitudes were then combined in one calibration equation 
that takes the airmass of the images into account. The 
science images for a given filter were bias subtracted and 
flatfielded in a standard way. The corrected science frames 
were then averaged to increase the signal to noise ra¬ 
tio. Instrumental magnitudes were derived via PSF pho¬ 
tometry wi th the aid of the packages DAOPHOT and 
ALLSTAR I Stetsonll 987|i . A summary of the observations 
is given in Tabled] The calibrated data were finally ex¬ 
tinction corre cted using the transformat ion E{b — y)= 0.7 
• E{B — V) llCrawford fc Barii^ll97(1l and E{v — y) = 
1.68 • E{b — y) a s calculated from a synthetic extinction 
curve taken from iFitznatrickI (1 0 OOji in combination with 
the Stromgren filter transmission curves. Upper limits for 
the photometric errors are 0.02 mag for the y and b bands 
and 0.03 mag for the v band. Since the fields of the MXU 
and the Danish 1.5m observations only overlapped in cer¬ 
tain parts, photometric information was not available for 
all objects. 


6.1.2. Surface gravities and temperatures for stars in 
M55 

To determine log g and Teg from the Stromgren pho¬ 
tometry, we measured the distances in the three- 
dimensional v,b,y space between the observed magnitudes 
and the corresponding filter magnitudes of an appro¬ 
priate isochrone^^^br_tlns_weused^_&e isochrone taken 
from iRergbiisch X VandenRer (tra nsformed to 

Stromgren colours bv IPoberts & Clrebeil99,5ji with an age 
of 16 Gyr and [Fe/H] = —1.80 dex with an assumed clus¬ 
ter distan ce of 5.3 kpc and a reddening of E{B — V) = 
0.08 mag llHarri iiHoi). We found that these parameters 
could best fit the (6 — y):y and (v — y):y colour magnitude 
diagrams. The reason why we used Stromgren isochrones 
which are still on the old age scale is that we had no access 
to newly calculated models in this photometric system. 

We then ‘projected’ each observed star onto the 
isochrone by calculating the distances between the tab¬ 
ulated values of v, b and y of the isochrone and the 
corresponding observed magnitudes. A star was then as¬ 
signed the log g and Tgg value of the isochrone for which 
the calculated distance in this three-dimensional mag¬ 
nitude space is minimum. Stellar parameter uncertain¬ 
ties A (log g) and A (Teg) are found by defining a (three- 
dimensional) error ellipse from the photometric errors 
A(u), A(6) and A{y) and by calculating the distance to the 
isochrone for equally spaced steps on this ellipsoid. The 
errors of log g and Teg are then found from the result¬ 
ing maximum and minimum values for each parameter. 
This procedure is visualized in Fig.^ Note that because 
of the slope of the isochrone and the position of a particu¬ 
lar star in the colour magnitude diagram, these errors are 
not symmetric. 

Note that the age value of the Stromgren isochrone is 
offset by 2.5 Gyr from the 13.5 Gyr as determined by the 
best fitting (a-enhanced) isochrone for Johnson photome¬ 
try (Fig.El. To check for consistency, we did tests with the 
available pre-imaging Johnson photomet ry and the fa/Fe ] 
= 0.3 dex Johnson isochrone (taken from iKim et aLl2fl^2^ . 
We found that there were significant differences (of about 
^ 80 K) in the Teg values only for the upper main se¬ 
quence and turn-off region stars, while the log g differences 
were of the order of 0.1 to 0.2 dex. Since we only want to 
demonstrate the principal capability of the networks to 
derive surface gravity and temperature estimates, we did 
not investigate this further. 

The parametrization performances for log g and Teg 
for stars in M 55 for which Stromgren photometry was 
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(v-y)o [mag] 


Fig. 3. Photometry in M55. The upper plot is the pre¬ 
imaging Johnson BV photometry for a selected range 
of luminosities. Two isochrones both with 13.5 Gyr and 
[Fe/H] = —1.8 dex are overplotted. The left one has solar 
scaled a abundances, the right one has [a/Fe] = 0.3 dex. 
The lower plot shows v — y Stromgren photometry and an 
isochrone with [Fe/H] = —1.8 dex and 16 Gyr (the old age 
scale). The heavy points show the stars for which spectra 
have been obtained. 


available are presented in Fig.^l Note that these results 
were obtained from network model2, i.e. networks trained 
with a higher weight on log g and T^g. In order to appreci¬ 
ate the results one should keep in mind that the networks 
were trained on synthetic spectra with a singl e microtur¬ 
bulenc e value of ^ = 2 kms“^. However, as iGrav et all 
1 2001(1 showed, a range of different microturbulences is 
important in order to correctly estimate surface gravity 
values. It should also be noted that the Tes and log g val¬ 
ues as estimated from the isochrone have uncertainties of 
their own. This is especially the case for those stars that 
lie close to the main-sequence turn-off, where a perpen¬ 
dicular projection of a star onto the isochrone (see above) 
gives rise to larger uncertainties. 

From the plots, it can be seen that the parametrization 
yields reasonable results. The scatter is somewhat larger 
for high log g and Teff values, which is most likely due 
to the fact that the values determined from the isochrone 
are uncertain in these regimes. For T^g we observe a small 
systematic offset of about 80 K. Given the intrinsic un¬ 



Fig. 4. A sketch to help visualize the procedure to find 
log g and T^g values from the Stromgren photometry. The 
solid line is an isochrone in the three-dimensional magni¬ 
tude space, while the ellipsoid is a three-dimensional error 
box about the measured point. The size of the error ellip¬ 
soid has been magnified by a factor 15 and displaced for 
clarity. The surface gravity and temperature for the mea¬ 
sured point is found from the shortest distance between 
this point and the isochrone. The errors for these two stel¬ 
lar parameters are found by projecting the error ellipsoid 
onto the isochrone and taking the difference between the 
extreme values of T^g and of log g. This is shown in the 
two-dimensional projection on the b, v plane. 


certainties of the target values and the fact that these are 
only limited tests (i.e. we did not put too much effort into 
finding the optimum network configurations), we conclude 
that the results are close to what can be obtained from 
such data in general (see Sect. 16.311 . 

6.1.3. Metallicity results for M 55 

The metallicity values determined from the neural net¬ 
works for the combined MSSGB and RGB sample are 
shown in the left panel in Fig. El These results were ob¬ 
tained from network modell, i.e. networks trained on all 
stellar parameters but with a higher weight on metallic¬ 
ity, see Sect. 15. ll The overall parametrization performance 
is good. The mean value of the metallicity distribution is 
— 1.86 dex compared with a literature value of —1.80 dex. 
The overall error as defined in Eq.Elis 0.15 dex. 

6.2. Model validation for standard stars 

We compared the parameter determinations of the 
neural networks on standard stars with those found 


































10 


Willemsen et al.: Analysis of medium resolution spectra 



isochrone (log g) [dex] 




- 1.5 - 0.5 0.5 1.5 

Delta (log g) [dex] 



-600 -200 200 600 
Delta (Teff) [K] 


Fig. 5. Parametrization results for log g (upper line) and 
Teff (lower line) for stars in M 55. The left panels shows the 
parameter values as determined from the isochrone versus 
the corresponding computed value from the network for 
the two parameters. The solid line is the identity. The 
horizontal and vertical error bars for each point are cal¬ 
culated from the photometric error ellipse (see Fig. El) and 
from 30 bootstrap networks, respectively. The right panel 
shows the boxplots of the residuals for each parameter, 
Delta = (computed value - isochrone value). The solid 
line in the box shows the median value of the residuals 
while the borders of the box are the first and third quar- 
tiles. Single points which have a larger distance to the 
median than 1.5 times the boxlength and are identified as 
outliers. The overall parametrization errors as defined in 
Eq.El are 0.32 dex and 190 K for the two parameters. 


from other methods a nd li sted in the catalogue of 
ICavrel de Strobel et all ll200lll. In cases of multiple ob¬ 
servations (from different authors) in this catalogue, we 
calculated the (unweighted) mean and the corresponding 
standard error for those values which were determined 
from high resolution spectroscopy. A summary of the val¬ 
ues is given in Table El The network models were then 
validated on the spectra of these stars (see Sect.EJ. We 
only used the networks trained for the MSSGB sample 
for this test (see Sect. 0 ). The results for log g and Teff 
were obtained from network model2 with a higher weight 
(during minimization) on surface gravity and tempera¬ 
ture, while the results for [Fe/H] are from network modell 
with a higher weight on metallicity, see Sect. 15.II The 
parametrization results are summarized in Fig.EI 

From the plots we see that the metallicity results are 
quite good, with an overall error of less than 0.15 dex. 
Interestingly, the overall parametrization performance for 
log g is not as good as for the stars in M 55, despite the 
fact that the S/N of the standard stars is much higher (but 
there are only 11 stars so the statistics is rather poor). 
Given that the stellar parameters of the standard stars 
are similar to those of the main-sequence stars in M55, 



[Fe/H] [dex] [Fe/H] [dex] 


Fig. 6. Distribution of the metallicities for stars (com¬ 
bined MSSGB and RGB sample) in M 55 (left panel) and 
uj Cen (right panel) as determined by our method. The 
dashed line is centred at —1.80 dex (the literature value 
of [Fe/H] for M55) while the solid line denotes the mean 
of the distribution. There is only a small offset of 0.06 dex. 
The overall error for the determined metallicities in M 55 
as defined in Eq.0is 0.15 dex. 


and from the fact that the input spectral ranges to the 
networks are identical for both types of stars, we conclude 
that these larger deviations are, paradoxically, related to 
the larger S/N. As outlined above, the S/N of the template 
spectra with which the networks were trained influences 
the degree of regularization, thus effecting the extent of 
systematic offsets (see Sect. 0 . Indeed, the S/N of the tem¬ 
plate spectra was in the range from 10 to 100, i.e. lower 
than that of the standard stars. To investigate this fur¬ 
ther, we performed limited tests with networks trained on 
spectra with S/N values in the range from 100 to 150 while 
keeping the weight decay parameter fixed. We found that 
the systematic offset in log g for the standard stars could 
indeed be decreased in this way (albeit not completely 
removed), while the variance remained almost the same. 
While these results support the idea of insufficient regu¬ 
larization, such a strong sensitivity of the network train¬ 
ing to the S/N of the template spectra is surprising and 
needs to be investigated in more detail in future studies. 
An explanation of why there are no systematic trends for 
[Ee/H] is possibly given by the fact that the networks for 
this parameter were highly tuned, in order to eliminate 
inaccuracies. 

Taking the stellar parameters predicted by the net¬ 
work for a given observed spectrum, we can generate the 
corresponding synthetic spectrum. Eig.0shows a compar¬ 
ison between two (observed) standard star spectra and 
the appropriate synthetic spectrum produced by interpo¬ 
lating the training sample using the stellar parameters as 
predicted by the neural networks. It can be seen that the 
model spectra represent the observed spectra rather well, 
especially in the case that the network’s predictions of the 
stellar parameters are close to the literature values. 
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literature ([Fe/H]) [dex] Delta ([Fe/H]) [dex] 



literature (log g) [dex] Delta (log g) [dex] 



literature (Teff) [K] Delta (Teff) [K] 

Fig. 7. As in Fig.0but for the ‘standard’ stars showing 
the network outputs compared to the averaged literature 
values for each of the parameters [Fe/H], log 5 and Tgff 
(from top to bottom). The errors for the literature values 
are found from the scatter about the unweighted mean 
value for the different stated values (see Tab. 13, while the 
computed errors are found from 30 bootstrap networks. 
Note that the metallicity range on which the networks 
were trained was limited to values larger than —2.5 dex, so 
we cannot expect good parametrization performances for 
metallicities below this value. Excluding the single point 
with the literature value [Fe/H] = —2.95 dex, we find an 
overall average error in [Fe/H] as defined in Eq.^of 0.15 
dex, while that for log g and T^e are 0.42 dex and 145 K 
respectively. 


6.3. Discussion of the parametrization results 

The overall p arametrization perfo rmance can be compared 
with that of ISnider et alJ ll200lll . They trained networks 
on real spectral data with similar resolutions and for stars 
with similar parameter ranges as in this project. They 
reported precisions of the parameter determinations for 
log g of 0.25 to 0.30 dex and 135-150 K for Tgs while that 
for [Fe/H] was 0.15 to 0.20 dex. Our results are in absolute 
agreement with these values, even though our networks 
were trained on synthetic data. 



Fig. 8. The spectroscopically observed stars in uj Cen are 
marked as he avy dots in the Str o mgren colour magnitude 
diagram from iHilker fc Richtlej ll200nll . Based on the ra¬ 
dial velocities, filled circles were identified as members, 
while open circles denote non-members. 

6.4. Metallicity results for u) Cen 

We applied the neural networks to the MSSGB and RGB 
samples of uj Gen to determine metallicities for these stars. 
We stress, however, that the metallicities found are some¬ 
what preliminary and we do not attempt to give a more 
detailed analysis. The complex formation and evolution 
history of tu Gen will ultimately require a much more so¬ 
phisticated approach in which synthetic spectra with dif¬ 
ferent helium and/or a element abundances are taken into 
account. The results were obtained from network model 1 
which was tuned to improve metallicity performance. A 
plot of all stars observed is shown in Fig.|S| Note that 
we only considered the stars up to the top of the giant 
branch in this study (see Sect. 13, i.e. there are no HB or 
AGB stars. These will be examined in a future study. 

The overall distribution of the metallicities found by 
our neural network analysis for lo Cen is shown in the 
right panel of Fig.|3 When compared to the distribution of 
metallicities found for M 55, we note a significantly larger 
spread. From the bootstrap errors for the metallicity val¬ 
ues plus the smaller overall metallicity spread found for 
the stars in M 55 (Fig.\^, we must conclude that this large 
spread does not indicate an erroneous parametrization but 
rather a real effect of the multiple stellar populations in oj 
Cen. 

7 . Concluding Remarks 

We have shown that the metallicity information in 
medium resolution spectra can be readily assessed by neu¬ 
ral networks trained on synthetic spectra and without ad- 
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Fig. 9. A comparison of two observed standard star spectra (solid lines) with the corresponding synthetic spectra 
(dotted lines) as predicted by the neural network. The network predicts only the stellar parameters, but from these 
the synthetic spectra were generated through interpolation of the training template grid of synthetic spectra. The 
upper plot is for a star with stellar parameter residuals of A([Fe/H])= —0.04 dex, A(log g)= 0.12 dex and A(Teff)= 
—89 K, where A is the difference (network output — literature value). The corresponding network outputs are [Fe/H] 
= —1.06 dex, log 5 = 4.16 dex and Teg= 5811 K. The lower plot shows an example were the difference between the 
log g prediction of the network and the literature value is rather large. For this star, we find A([Fe/H])= —0.01 dex, 
A(log (/)=!.11 dex and A(reff)= 109 K and networks outputs of [Fe/H] = —2.15 dex, log g= 4.03 dex and Teff= 4984 
K. The S/N of the synthetic spectra was decreased to 150, i.e. similar to that of the observed spectra. 


ditional photometric information. The neural network ap¬ 
proach works in an objective way and is, ultimately, time 
efficient, relevant when large sets of spectra of stars in 
different objects (clusters) are to be analysed. In order to 
obtain some measure of uncertainty on the stellar parame¬ 
ters determined, we applied the bootstrap method to neu¬ 
ral networks for the first time for this kind of parametriza- 
tion problem. 

The estimated metallicity accuracies are of the order of 
~ 0.2 dex which is close to what one can expect from such 
data with any method. We additionally performed limited 
tests on the retrieval of log g and T^g and found that the 
overall accuracies for these parameters are ~ 0.3 to 0.4 
dex and ~ 150 to 190 K. These results could probably be 
further improved once a suitable grid of synthetic spec¬ 
tra with a range of different microturbulent velocities is 
available. In addition, the effects of a abundances should 
be taken into account to test how well this additional pa¬ 
rameter can be determined and whether neural networks 
are able to discriminate between the effects of [Fe/H] and 
[a/Fe] from normalized spectra alone. Moreover, and es¬ 
pecially with regard to uj Cen, the recent suggestions of 
enhanced helium abundances f or the intermediate metal¬ 
licity population llNorrijl2nn4 should be taken into ac¬ 
count in future simulations of the synthetic templates. As 
mentioned above, an increased helium abundance results 


in an overall increased molecular weight which affects the 
absorption lines in similar ways to log g. 

Future tests could also include photometric data (e.g. 
from the pre-imaging observations). Since temperature 
strongly affects the continuum of a stellar energy distri¬ 
bution, any colour information should improve the perfor¬ 
mance for this parameter. Once a larger set of observed 
spectra for stars in normal globular clusters (i.e. with a 
well defined metallicity) is available, we will train net¬ 
works on these data. Although synthetic spectra can re¬ 
produce observed stellar energy distributions rather well, 
we believe that a network trained on real data with simi¬ 
lar noise and detector characteristics will result in better 
metallicity estimates for previously unseen data. 
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